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Abstract 


One approach to qualitative reasoning about dynamical systems is 
to extract qualitative information by searching or making queries on 
databases containing very large numbers of trajectories. The efficiency 
of such queries depends crucially upon finding an appropriate data 
structure for trajectories of dynamical systems. 

Suppose that a large number of parameterized trajectories 7 of 
a dynamical system evolving in R^ are stored in a database. Let 
r] C R n denote a parameterized path in Euclidean space, and let || • || 
denote a norm on the space of paths. In this paper, we define a data 
structure to represent trajectories of dynamical systems and sketch an 
algorithm which answers queries of the following form: 

Query. Return the trajectory 7 from the database which minimizes 
the norm 

lh-711- 




1 Introduction 


1.1 Queries about Trajectories 

Suppose that a large number of parameterized trajectories 7 of a dynamical 
system evolving in K N are stored in a database. Let 77 C R^ denote a 
parameterized path in Euclidean space, and let || • || denote a norm on the 
space of paths to be specified later. In this paper, we define a data struc- 
ture to represent trajectories of dynamical systems and sketch algorithms to 
answer queries of the following forms: 

Query 1 . Return the trajectory 7 from the database which minimizes the 
norm 

ll»7- 711- 

Query 2. Fix e > 0. Return all trajectories 7 from the database which 
satisfy 

ll»7-7|| < c- 

Query 3. If 77 is a segment of trajectory stored in the database, return the 
entire trajectory 7. 

Efficient algorithms to answer these type of queries should prove useful 
for a number of applications. As an example, consider the path-planning 
problem for a robotic arm. Suppose that a large number of feasible tra- 
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jectories of the robotic arm have been stored in a database. Let 77 be the 
desired path of the arm. It is not necessary that 77 be an actual feasible 
path. Query 1 would return the feasible trajectory 7 of the arm which is 
closest to the desired path 77. 

An another example, assume the database contains trajectories not from 
just one dynamical system, but from a parameterized family of dynamical 
systems. Let 7 denote a trajectory which is a periodic orbit. Then Query 2 
could be used to gain information about those dynamical systems which also 
have a periodic orbit. In other words, queries such as these would be useful 
in extracting qualitative information about dynamical systems. 

As a final example, consider a database containing control trajectories 
for an aircraft, and assume that auxiliary information is attached to each 
control trajectory in the database. For example, those trajectories which 
enter into an unstable control regime somewhere along their flight path 
could be tagged. Let 77 denote a measured portion of the flight path of the 
aircraft. Then Query 1 would return the nearest full control trajectory in 
the database, which would include information about the stability of the 
trajectory. In other words, the query could be used as part of a supervisory 
control system. 

A database supporting these types of queries on trajectories needs an 
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efficient means of storing, accessing, searching, and comparing trajectories. 
In this paper, we describe a data structure for trajectories, a related family of 
index functions for trajectories, and sketch algorithms to answer the queries 
above. The work described in this paper is preliminary. For a more complete 
analysis of the issues raised here, a complexity analysis of the algorithms, 
and a discussion of implementation issues, see [9]. 

The data structure we describe is closely related to hashing methods for 
curves that have been used in computer vision; see [13], [16], and [11]. A 
related means of extracting qualitative information from dynamical systems 
is described in [1] and [2]. 

In Section 2 we review the relevant facts about trajectories of differential 
equations and define different data structures to store trajectories. In Sec- 
tion 3, we show how these data structures can be used to answer the queries 
above. Section 4 contains some concluding remarks. The remainder of this 
introduction discusses this work from the viewpoint of extensible databases. 

1.2 Extensible Databases 

Imagine an extensible relational database supporting the basic data type of 
trajectory, as well as the more traditional data types of string, integer, float, 
etc. The inclusion of this new data type also requires that new operators be 
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supported, new storage and access methods be developed, and new methods 
of optimizing queries be found. 

For example, it is important that the database support the storage and 
retrieval of trajectories of an arbitrarily large size. It is also important that 
operators defined on the entire trajectory be supported, such as operators 
returning various temporal and spatial averages. Our viewpoint is to look 
for specific data structures and algorithms to handle the storage, access, 
and querying of trajectories. This is not the only viewpoint that has been 
proposed. 

Another viewpoint is to change the relational model as little as possible 
and build a complete extensible DBMS on top of it. For example, Stone- 
breaker [14] describes a mechanism for a user to register new abstract data 
types into the University version of Ingres. Because of the built-in access 
methods used by Ingres, the new data types must occupy a fixed amount 
of space. Still another viewpoint is to provide a modular and modifiable 
system on top of which extensible databases for specific applications can be 
built. The Exodus DBMS described in [7] is an example of such an approach. 
See [6] for a recent survey covering some of these issues and detailing other 
approaches. 

The queries above would not make very much sense for a traditional 
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relational database. On the other hand, queries such as these are natural 
queries for an extensible relational database of trajectories. They are also 
typical of the queries found in computational geometry; see [12]. Query 1 is 
an example of a nearest neighbor query, while Query 2 is an example of a 
fixed-radius near neighbor query. Bentley [4] considers both of the queries 
for the case of points in Euclidean space. 

It is also important to understand that while Query 3 does not make 
sense in general for spatial curves, it is meaningful for trajectories of dif- 
ferential equations. Recall that a trajectory of a differential equation is 
determined once the initial condition and defining equation are specified; 
see [3], for example. This means, in principle, that given any point on the 
trajectory, that point together with the defining equation is sufficient to de- 
termine the entire trajectory. Alternatively, given a sequence of points on 
the trajectory which is sufficient to specify the form of the differential equa- 
tion defining the dynamical system, the entire trajectory can be obtained. 
It is for these reasons that Query 3 makes sense for trajectories of dynamical 
systems. 
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2 Representations of Paths 

2.1 Trajectories 

In this section, we recall some basic facts and defintions about trajectories 

of differential equations. Let D M = d/dx M . A vector field 

N 

M=1 

on r n is determined by specifying N functions 

: R N — R. 

We also denote the vector field by E a ■ In this paper, we will consider only 
vector fields with polynomial coefficients; that is, the functions ai,...,a;v 
are all polynomials in X \ , . . . , xjq. A parameterized path 

7 : [t 0 ,* 1 ] CR-R N 

is called a trajectory of the dynamical system 

x(t) = E a {x{t)) (1) 
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in case it is the unique solution of the intitial value problem 

i(<) = E a (x(t)), x(t°) = 7 (t°). ( 2 ) 

We distinguish between various representations of trajectories. 

Spatial Curve Representation. In this representation, the trajectory is 
viewed simply as the set of points in which comprise it. 

Parameterized Path Representation. In this representation, the tra- 
jectory is viewed as a map 

7 : [/V 1 ] C R — + RA. 

We may pass from the spatial curve representation to the parameter- 
ized path representation by parameterizing the curve using arclength. 

Vector Field/Reference Point Representation. In this representation, 
we identify the trajectory with a pair ( E,R ), consisting of a vector 
field E and a reference or initial point R, where the trajectory is the 
solution of the initial value problem 

x{t) = E(x(t)), x(t°) = R. 

Note that the Vector Field/Reference Point representation, or VEFREP, 
is not unique. Indeed, several different vector fields could have a given spatial 
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curve as a trajectory, while any point along the spatial curve could serve as 
the initial value. Even so, as we will see in later sections, this representation 
is convenient for query processing. 

2.2 Rectifying Trees 

In this section, we give an algorithm whose input is a parameterized path 

*?:[<V] CR-R n , 

and whose output is a labeled, rooted binary tree. We assume for conveniece 
that t° = 0 and t 1 = 1; if not, we can reparameterize. We do not assume 
that rj is a trajectory of the dynamical system (1). This tree will be used in 
later sections to define polynomials fli,...,a 7 v with the property that rj is 
close to a trajectory of the dynamical system (1). 

To define the tree, we first fix a tolerance e > 0. The tree we define is a 
subset of the complete rooted binary tree. The height of a node is defined 
to be the length of the unique path connecting the node to the root. The 
children of the root have height 1, their children height 2, etc. There are 2 k 
children of height k : number them left to right from 1 to 2 k . We assign two 
labels to the jth node v from the left at height k: 

= (l/2‘) (■)(£)- £R" 
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and 


e ( v ) = v( L yr) eR" 

We use the following stopping criterion to grow the tree. If a node has 
children v and v* with labels k and respectively, and if 

||« — K ; || < 6 , 

then the nodes v and v l are leaves. Here || • || denotes the Euclidean norm. 
We denote by T{rf) the tree that arises in this fashion. This tree has a simple 
interpretation: the 6 labels represent points on the path 77, while the k labels 
represent approximate tangent vectors at those points. The tree is grown 
until the difference between two adjacent tangent vectors is uniformly small. 

2.3 Trees and Interpolating Vector Fields 

In the last section, we associated a tree T{rj) to each path 77 C H N . In 
this section, we associate a vector field E(rj) to each path using the tree. 
The vector field E(rj) is simply the vector field with polynomial coefficients 
which interpolates the labels ( 0 (u),tj(u)) 

E( V )(6(v)) = k(v), (3) 

for all leaves v in Recall that 0(v) is the point on the curve 77 corre- 

sponding to the node v, and k(v) is the approximate tangent to the curve 
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at that point. 

Suppose that the tree corresponding to the path rj has K leaves. Once 
we fix a vector space basis of K monomials 


■*' , . . . , X , 


the equations (3) define a vector field 

N 

E a = 

m=i 

where ai(x), . . . , a/v(x) are polynomial functions of the form 

K 

a* = bffz 1 *’ 
j = i 

and the bj are scalars. The equations (3) reduce to linear equations for the 
scalars bj . To write down these equations, let v \ , . . . , vr denote the K leaves 
of the tree T(rj), and for i = 1 ...,N denote the IV-components of 

the vector k(vj). Then the system (3) becomes 


x |o(v,) 


V 


*01 


1(11,, 1 ' 




K^(Vl) 

!•(.«) ) 




V ^(VK) ) 


for [i = 1 , ... ,1V. 

The number of leaves of the tree T(tj) depends very strongly on the 
geometry of the path 77 . As the complexity of the curve grows, so does 
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the degree of the polynomial coefficients of the vector field E(t]). For some 
applications, it is better to impose an upper bound on the degree of the 
polynomial coefficients of the vector field E(t /). Let q denote this bound. In 
this case, we can define the vector field E(rj) by requiring that the coefficients 
bj minimize the quantity 

E ii£(i)(*w) - «wn, m 

leaves v 

where the minimum is over vector fields with polynomial coefficients of de- 
gree less than or equal to q. See Davis [8] for explicit means of finding the 
coefficients bj satisfying these equations. 

2.4 Reference Points 
Let 

r / : [t 0 ,* 1 ] C R — + 

be a parametrized path, T(tj) the corresponding tree, and E(q) the asso- 
ciated vector field with polynomial coefficients. Consider the trajectory 
defined by the initial value problem 

X(t) = E a (x(t)), X(0) = T](t°). 

Let 7 denote this trajectory. In general, 7 is only an approximation to the 
path 77. Define R(rj) G as follows: if the path 7 and the unit sphere in 
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R^ intersect, let R(t]) denote this intersection; otherwise, let -R(rj) denote 
the closest point between the unit sphere and the trajectory 7. 

To summarize, given a path 77 C R' V , we have defined a rectification 
tree T( 77), a vector field E(r)), and a reference point R(rj) G R^, with the 
property that there is a segment of the trajectory 7 which approximates the 
path 77, where 7 is the solution of the initial value problem 

x(t) = E(ff)(x(t)) t z( 0 ) = R(r 7 ). (5) 

The pair (E(tj), R(t])) is called the VEFREP representation of the path 77. 

It is now easy to define an index or invariant associated with the path 
77. First, fix injective functions 

h n :R n — {1,2,...}, 

for each n = 1,2 , .. . Next, view the coefficients of the vector field E( 77) as a 
K ■ N vector, so that the pair (E(t]), R(r])) has (K + l)N components. Then 
the index associated with 77 is defined by 

h(v) = h(K+i)N(E(rj), R(t]))- 

This can be used to hash the trajectory; see [9]. 
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3 Returning the Nearest Trajectory 
3.1 Proximity Problems 

In this section, we show that the VEFREP representation of a path reduces 
Queries 1 and 2 to the following queries: 

Query 1 '. Given P points in R^, preprocess the points in time Preprocess(P) 
so that given a query point we can determine in time Query(P) the point of 
the original set which is closest to it, using storage Storage(P). 

Query 2'. Given P points in R^, preprocess the points in time Preprocess(P) 
so that given a query point we can determine in time Query(P) all points 
of the original set which are within a distance of <! of it, using storage 
Storage(P). 

Query 1' is called the nearest neighbor problem, and Query 2' is called 
the fixed radius near neighbor problem. For a general reference to proximity 
problems such as these, see [12]. This reference also describes algorithms 
to solve these types of problems using techniques which we do not mention 
here, such as Voronoi diagrams. The naive exhaustive algorithm requires 

Storage(P) = 0(P) 

Query(P) = O(QP). 
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In [4], Bentley shows, with the assumption of sparsity, how to use a multi- 
dimensional divide-and-conquer algorithm to achieve bounds 

Preprocess(,P) = 0(Plog^ _1 P) 

Storage(P) = OfPlog^P) 

Query(P) = O(log g P). 

In [5], Bentley et. al., use a cell method and the assumption of sparsity to 
achieve expected bounds of 

Preprocess(P) = 0(P) 

Query(P) = 0(1), 

assuming that the original points are drawn independently from a uniform 
distribution of a bounded region. 

Both of these algorithms have constants which grow exponentially with 
the dimension. Because of this, it is doubtful whether either one could lead 
to practical algorithms to answer Queries 1 and 2. 

3.2 The Algorithm 

In this section, we sketch an algorithm to answer Queries 1-3. For further 
details, see [9]. 
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Suppose that 71, ... ,7 p are trajectories of the dynamical system 

x(tf) = E a (x(t)), 

as a ranges over some parameter space. To each such trajectory 7, let 

(£( 7), *(7)) 

denote its VEFREP representation. We define a norm on the space of 
trajectories via 

|| 7 || = ||(i?(7),^))ll, 

where the norm on the right hand side is the standard Euclidean norm. 
Given a parameterized path 77, Algorithm 1 below returns from the database 
the trajectory 7 which contains a segment which is closer to 77 than any other 
segment from any other trajectory in the database. Notice that this provides 
an answer to both Query 1 and Query 3. Query 2 is handled in an analogous 
manner; again, see [9] for details. 

Algorithm 1* The input is a parameterized path 7), and the output is the 
trajectory 7 from the database answering Query 1. Fix e > 0 and q > 1. 
Compute the vector field E of the VEFREP representations (E,R) in the 
algorithm using Equation 4. 

Step 1. This step is a precomputation. For each trajectory 7;, i = 1, . . . , P, 
compute its VEFREP representation (£(7*-), J2(7,-)). 
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Step 2. Given a query path 77, compute its VEFREP representation ( E(rj ), #(*7)). 

Step 3 . Solve the nearest neighbor problem for query point (£*(77), 72(77)), 
given the set of points 




Let (E,R) denote the resulting nearest neighbor. 


Step 4. Compute the trajectory 7 which is the solution to the initial value 
problem 

x(t) = E(x(t )), x(0) = R . 


4 Conclusion 

In this paper, we have described preliminary work concerned with queries 
of databases containing trajectories of differential equations. Our eventual 
goal is to build an extensible, relational database supporting these types of 
queries. 

Trajectories of differential equations have many different representations. 
For the types of queries considered here, we have chosen to represent pa* 
rameterized trajectories 

7 : [t 0 ,* 1 ] CR — ♦ K n 


18 



by a pair, consisting of a vector field E on R^ with polynomial coefficients 
and a point R 6 R^ such that the trajectory is the solution of the initial 
value problem: 

x(t) = E(x(t )), x(t°) = R . 

We call this a VEFREP representation. 

Using the VEFREP representation, we have reduced Query 1 and Query 2 
to standard problems in the computational geometry of points in Euclidean 
space, the nearest neighbor problem, and the near neighbor problem, re- 
spectively. 

Much work remains to be done: the efficiency and implementation of 
the algorithm needs to be studied. We do not claim that the algorithm 
described here is necessarily the most practical for querying large databases 
of trajectories. On the other hand, it is simple enough that its properties 
can easily be understood and investigated. It is our hope that this will lead 
to more practical algorithms. 
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